Efficiently contractable quantum circuits cannot produce much entanglement 
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We show a similarity between two different classical simulation methods for measurement based 
quantum computation - one relying on a low entanglement (tree tensor network) representation of 
the computer's state, and the other a tensor contraction method based on the topology of the graph 
state. We use this similarity to show that any quantum circuit that can be efficiently simulated via 
tensor contraction cannot produce much entanglement. 



Characterizing classes of quantum computations that 
can be simulated efficiently on a classical computer is 
one of the most useful ways for investigating the power 
of quantum computation. In particular classical simula- 
tions can be used to identify the crucial properties which 
are responsible for the advantage that quantum com- 
putation has over classical computation. One property 
that has been shown as essential for a valuable quantum 
computation is entanglement (measured by the Schmidt 
number). Any computation that does not produce much 
entanglement, in the sense that the state of the com- 
puter, at any stage of the computation, can be described 
through a sequence of Schmidt decompositions for which 
the Schmidt number is bounded by poly(n) (n - the num- 
ber of qubits), can be efficiently simulated on a classical 
computer P, Q- 

One method for simulating quantum circuits which 
does not make use of a low-entanglement representation 
of the computer's state is tensor contraction due Markov 
and Shi 3 ■ This method relies solely on the topology of 
the circuit, or more accurately, of the graph created by 
representing the gates as vertices and the qubit wires as 
edges. It was shown that a number of families of quantum 
circuits including log-depth circuits with nearest neigh- 
bor interaction [3| and the approximate quantum Fourier 
transform (which was not known to be efficiently sim- 
ulable by any other method), can be efficiently simu- 
lated by this method [1, 0] . The relation between low- 
entanglement based methods and tensor contraction was 
not completely clear. In particular, the question how 
much entanglement can circuits whose topology allows 
for an efficient classical simulation produce was until now 
open. 

In this work we first consider classical simulations of 
measurement based quantum computation (MQC). We 
show a similarity between the tree tensor network (TTN) 
simulation method of 0, 0] - which is a low- entanglement 
method based on a sequence of Schmidt decompositions 
(relying on low enough Schmidt numbers) - and a ver- 
sion of the tensor contraction method, as defined in [3], 
for MQC. We then use this similarity to show that any 
quantum circuit which is efficiently contractable cannot 
produce much entanglement. Namely, during the entire 
computation the state of the computer has an efficient 
representation (as a TTN) in terms of as sequence of 



Schmidt decompositions. 

In the methods above (as in all other simulation meth- 
ods of quantum computation that we know of) one sim- 
ulates a quantum computation through sampling in a 
qubit by qubit manner. Explicitly, one computes the 
probabilities for the output measurement on one qubit 
and samples from them, then one computes the condi- 
tional probabilities for a measurement on a second qubit 
given that the sampled outcome for the first qubit had 
been obtained and so on. At each stage one computes 
the conditional probabilities for a measurement on one 
qubit given that previously sampled outcomes had been 
obtained. At the end of the process one obtains an out- 
come, of the entire computation, with the same proba- 
bility as the quantum computer. 

Let us first consider the simulation method based on 
the representation of the system as a TTN. This represen- 
tation of a quantum state is a generalization of Vidal's 
l] matrix product state (MPS) representation. In the 
MPS representation of a system of n qubits one consid- 
ers n — 1 Schmidt decompositions according to a chosen 
ordering (that is, one considers the first qubit against 
the rest then the first two qubit against the rest and so 
on), whereas in the TTN representation one considers a 
sequence of Schmidt decompositions along a chosen tree 
structure. Explicitly, given an n qubit system we con- 
sider a tree which has these qubits as its leaves as in 
Fig. [TJ Removing an edge from the tree will induce a 
partition of the tree into two sub-trees. The Schmidt 
decompositions we consider in a TTN are those corre- 
sponding to partitions induced by removing one internal 
edge of such a tree graph. In [1, 0| a subcubic tree was 
considered where all vertices, except the leaves, including 
the root have three edges. Here we shall turn this into a 
binary tree by adding a degree two vertex proportional 
to the identity (which will be the new root) to one of the 
edges connected to the original root. In a TTN repre- 
sentation a tensor is associated with each vertex of the 
tree. The indices of these tensors are associated with the 
edges connected to the vertex. Therefore in our TTN ev- 
ery tensor (except the root) would be of rank 3. Indices 
associated with edges connected to the leaves correspond 
to the value of the qubit in the computational basis. So 
that given an n qubit state and a binary tree structure 
on that state, the coefficients of the computational basis 
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FIG. 1: A tree structure on a 5 qubits state. The partition 
corresponding to the edge a is shown. 



expansion of the state are given in terms of a contraction 
over these tensors. For example, a TTN representation 
of the state depicted in Fig. [T] would be 

^ ^ ^ ^ ^ii,i2,c(^j3,a,/3^/3,7^i4,i5,7 Kl ' ■ ' ^s) (1) 
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where the i indices are the value of the qubit (in the 
computational basis) and the Greek indices correspond to 
the internal edges of the tree and thus correspond to the 
Schmidt basis of the partition induced by this edge. So 
that, for instance, the Schmidt vectors corresponding to 
the decomposition along the edge a (up to normalization) 
would be 

|^[3,4,5]^^ E ^,3,„,;3A;3,7^»4,»5,7K3,«4,i5) • (2) 
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Denoting the largest Schmidt number for a certain tree 
by X, the number of complex parameters describing the 
state is of the order Oinx")- 

It was shown 0] that using a TTN representation the 
response of the the system to local operations and clas- 
sical communication (LOCC) as well as two-qubit oper- 
ations can be efhciently simulated classically. Therefore, 
one can efficiently simulate any computation that does 
not generate much entanglement (where x is bounded by 
poly(n)). Furthermore, since in MQC the computation is 
performed by LOCC that can not increase the entangle- 
ment, any MQC can be efficiently simulated given that 
the initial state has an efficient TTN representation. 

Let us now consider the tensor contraction method for 
simulating quantum circuits 0, In this method we 
first associate a graph with the circuit, representing each 
input qubit, gate and output measurement by a vertex, 
and each wire by an edge (e.g. a two qubit gate would 
correspond to a vertex of degree four). We then label 
each edge with a different index. Finally, to each vertex 
we associate a tensor describing the operation performed 
at that point. This tensor has indices corresponding to 
all edges connected to that vertex (so that its rank is 
equal to the degree of the vertex). The probability to 
obtain a certain outcome for some measurement is given 



by contracting these tensors, (that is, taking the product 
of all these tensors and summing over all indices) . 

Of course, one cannot efficiently sum over all indices 
at the same time as there are exponentially many terms. 
To avoid this one contracts the tensors together one at a 
time - in each step of the computation one replaces two 
existing tensors with a new one obtained by summing 
over any joint indices. We repeat this procedure until we 
are left with a single tensor with no free indices, which is 
the desired probability. The key element of the method 
is the ordering of the contraction. The aim is to order 
the contractions so that one never generate tensors with 
too many indices during this process. 

The contraction process can be described [l] by a se- 
quence of sets of vertices S — (s^ , . . . , s^) - each of which 
corresponds to a particular tensor that is generated dur- 
ing the computation. Each set S 5 is either the union 
of two previous sets, or one previous set and a vertex, or 
two vertices. Denoting the set of all vertices by V: 

I r^i , , -lii 1 either V- = , k <i , 

s* = {f Uin} where J , ' (3) 

^ ^ or tj ^ {v} , V eV. ^ ' 

The calculation of the probability is done in N steps, 
where in step i we compute a new tensor by summing over 
all indices corresponding to edges connecting t\ to t\. For 
the computation to be complete, we require that the final 
set — V . The number of edges connecting vertices 
in to vertices outside s* [U] is the rank of the tensor 
corresponding to s*. The simulation corresponding to the 
sequence S will be an efficient one if = max^ U — 

O(logn). 

In what follows we consider simulations of MQC. The 
state used as the resource for MQC is the graph state, 
which we consider here as any state that is generated by 
taking a set of qubits with each of them initially in the 
state |-|-) and applying cphase operations to any number 
of pairs of qubits (the order in which these operations 
are applied is irrelevant, one would always end up in the 
same state) we will refer to these operations as cphase 
connections). The underlying graph (G) of such a graph 
state is obtained by associating a vertex to each qubit 
and an edge to each cphase connection. 

A natural way of defining a tensor contraction scheme 
for MQC is by using PEPS [8]. In such a scheme a ten- 
sor is associated with each of the qubits in the graph 
state. Such a tensor corresponding to a particular qubit 
would have an index for each cphase connection plus an 
additional index for the value of the qubit. The PEPS 
representation of a graph state would be of the form 

I*) - E E^w.---^wJ^i---^")' (4) 

{a} 

where {a} donates the set of all the indices corresponding 
to cphase connections in the graph, {a}j denotes the 
subset of indices corresponding to cphase connections of 
qubit j. 
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In order to simulate MQC on a graph state it is enough 
to calculate the probabilities for single qubit measure- 
ments. In more general terms, we consider a measure- 
ment described by a set of POVM elements {Er} each 
composed of single qubit POVM elements. Namely, 
Er = ii^n ■ ■'S) Er„ where rj represents an outcome of a 
single qubit measurement. For a projective measurement 
(on qubit j) Er- = and for an unmeasured qubit 

we define Er^ := /. The probability {P{r)) for obtaining 
a certain outcome r = ri • • • r„ is given by tr(Er \'ip){ip\), 
and from ^ we have. 

P{r)= y Sfl?, , ; •••M"w ; (5) 

^ ' /-^ {"}i{a}i' {"}2{a}2' ^ ' 

where 

{a}j{a}y {a}, {a}y\jl r., \ j / 

The probability is, therefore, given in terms of a tensor 
contraction scheme on the same graph as the underlying 
G, where a tensor B^''^ is associated with the vertex cor- 
responding to qubit k. Note that in this tensor network 
there is a pair of joint indices for each edge in the graph - 
one from the subset {a} and the corresponding one from 
{a}' - we can treat the pair as a single index admitting 
four values. The above expression ([5]) for the probabil- 
ity can be obtained by placing the graph associated with 
the bra {tp] and its tensor network on 'top' of the graph 
of IV') (the two graphs are similar only the associated 
tensor network of the first is the complex conjugate of 
the other) and summing over the indices of the qubits 
(the ifc's) with the appropriate measurement elements in 
between. 

Once we have expressed the probabilities in terms of a 
contraction scheme over a graph (namely, the probability 
is given by contracting a set tensors where each tensor is 
associated with a vertex and each index of these tensors 
is associated with an edge) then all the results obtained 
in Q and Q automatically apply here. 

The key point here is that a sequence S defined on 
a system of qubits is completely equivalent to a binary 
tree structure T on the same system - S defines a tree 
T and vice versa. Any subset s*' G 5 of vertices (which 
now corresponds to some set of qubits) corresponds to a 
subtree of T. 

What can we learn from the above equivalence? A 
bipartition of the system corresponding to an internal 
edge e in the tree T (that is the partition induced by 
removing e), is simply a partition to a set in S and the 
rest of the system. Let us denote the Schmidt number 
corresponding to this bipartition by Xe and the number 
of cphase-connections between qubits corresponding to 
vertices in the set and the rest of the system (or in other 
words the number of edges in G connecting vertices in 
the set to vertices outside the set) by L^- Clearly Lg > 



log2 Xe ■ As this is true for any e S T it is certainly 
true for the one with the maximal Schmidt number. The 
edge in T for which we obtain the maximal might be 
a different one but obviously the corresponding partition 
cannot have less cphase-connections. Thus, for any graph 
state and any tree structure on it T we have 

maxLe(|^),r) > maxlogaXe'd^),?') (6) 

The minimum of the left hand side over all possible trees 
is the contraction complexity of the underlying graph G 
denoted by cc(G), which determines the complexity of 
the best tensor contraction scheme on the graph state. 
Classically simulating any computation performed on lib) 
would require poly(rt, 2'^'^^'^^) computational resources [3|]. 
The minimum of the right hand side of ^ over all pos- 
sible trees is the Schmidt-rank width (or x-width) of the 
graph state - Xiiid(|'0))j which determines the size of the 
most efficient TTN description of the graph state (re- 
quiring 0{n2^^""') complex parameters). Classical simu- 
lation of any computation performed on this state would 
require poly(n, 2^™'*) computational resources [1, 0]- As 
© applies to any tree structure T it applies also to the 
particular tree for which the minimum of the left hand 
side is obtained. The minimum of the right hand side 
therefore cannot be bigger. Thus 

cc(G) > xUm) (7) 

Hence, if we have an efficient tensor contraction simula- 
tion of MQC then we are assured that there is an effi- 
cient simulation of the same computation in terms of a 
TTN. Note that the opposite claim is not true. Indeed, 
the number of cphase-connections between two parts of 
a graph state gives an upper bound to log2 Xe, however 
there can be graph states where the number of connec- 
tions greatly exceeds log2 Xe- The fully connected graph, 
for example, where each of the qubits is connected to all 
the rest, has Schmidt number 2 for any possible biparti- 
tion of the state. 

Yet, the graph states that one usually considers as a re- 
source for quantum computation are those without much 
excess of cphasc connections. In particular, graph states 
where the number of connections per qubit (the degree 
of the vertices) is bounded by some constant - A. In 
that case, for a bipartition of the system corresponding 
to edge e, into a subtree (A) and the rest of the sys- 
tem (B), the maximal number of qubits in A connected 
to qubits in B is Alog2Xe- We can verify this by the 
following procedure. We construct a sequence of sets of 
qubits (Fi, F2, • • ■ ). Fi includes a single qubit in B and 
all the qubits that are connected to it in A. F2 consists 
of a different qubit in B and all the qubits that are con- 
nected to it in A, except those that are already included 
in Fi and so on. We keep constructing such sets until 
each of the qubits in A that are connected to qubits in B 
is included in one of those sets. We now undo all cphase 
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connections within A and B (not affecting the entangle- 
ment between the two sides). Next we measure (in the z 
basis) all the qubits in i^i n ^ save one, so that we are 
left with a maximally entangled pair, where the qubit in 
FiHB is connected only to its counterpart in Fi n^. The 
qubit in Fi n ^ might be connected also to other qubits 
in B however these connections can be undone by lo- 
cal complementation Q (i.e. by local Clifford operations) 
and unitary operation local to B. Thus, by using op- 
erations which could only decrease the entanglement we 
have produced out of the first set a maximally entangled 
pair with no connections to other qubits. At that stage 
the qubit in F2 H B is connected only to qubits within 
F2 , therefore we can repeat the procedure and produce a 
maximally entangled pair also from this set. We proceed 
in the same way generating a maximally entangled pair 
from each set, each carrying exactly one ebit. Clearly, 
the maximal number of pairs, and therefore of sets, can- 
not be greater than log2 Xe ■ As the maximal number of 
qubits from A in each set is A we get the above bound. 
The upper bound on is thus log2 Xa.b since each 
qubit in A is connected to at most A qubits in B. Since 
this bound applies to any bipartition it also applies to the 
one for which Le is maximal for a given tree. Obviously, 
the maximal Xe for this tree can only be greater, thus 

maxie(|^),T) < A2maxlog2Xe'(|V'),r) (8) 

eGT e £T 

Using the same arguments as above we can take a min- 
imum over all trees for both sides of the inequality. In- 
cluding ^ we therefore have 

xum) < cc{G) < A\um (9) 

Showing the equivalence, up to a constant, of both meth- 
ods of simulations of MQC for any G of maximal degree 
A. A topological parameter which determines cc{G) is 
the tree width of G (twd(G)). In [3] upper and lower 
bounds to cc(G) were given in terms of twd(G). Xwd{G) 
is equal to a different parameter of the graph - the rank 
width of G (rwd(G)) @, [l^. Using the bounds in [3] 
the above inequalities can be written in terms of these 
parameters. For any graph G we have 

^"^"^l?,'^ < rwd(G) < A(twd(G) + 1) - 1 
2A^ 

So far we have considered simulation of quantum com- 
putation on graph states. Let us now consider quantum 
circuits. Given a quantum circuit (C) we can on one hand 
define a a tensor contraction scheme as in 0, 0, Q where 
the corresponding graph (Gc) is constructed by associat- 
ing a vertex to each circuit element. On the other hand 
we can perform the same computation using MQC on a 
graph state. The standard way to construct a graph state 
for a given circuit is by associating a chain of qubits for 
each logical qubit along which the data would progress 



via single qubit measurements while undergoing one and 
two qubit gate applications. For each single qubit gate 
up to three qubits should be introduced to the chain and 
two qubit gates are realized by introducing cphase con- 
nections between the two corresponding chains. We de- 
note, as before, the underlying graph of the graph state 
by G. 

Clearly Gc is not identical to G. One difference is that 
in G a sequence of up to three vertices of degree 2 may 
correspond to a single one-qubit gate (which corresponds 
to one such vertex in Gc). The second difference is that 
a two qubit gate is associated with a single vertex of de- 
gree 4 while in G it would correspond to two vertices 
of degree 3 with an edge between them. Clearly, the 
additional vertices of degree 2 have no effect on the con- 
traction complexity of G - any degree 2 vertex can be 
contracted together with a neighboring vertex or a set of 
vertices without changing its number of outgoing edges. 
Since the two vertices of degree 3 can be combined to 
form a set with four outgoing edges (representing a rank 
4 tensor just as the corresponding vertex in Gc), it is 
clear that 

cc(G) < cc{Gc) 

Hence, if a quantum circuit has an efficient tensor con- 
traction scheme simulating it then we are assured that so 
would the MQC version of that circuit on a graph state, 
and from ([7]) we know that the this graph state would 
have an efficient TTN representation and consequently 
an efficient TTN-based classical simulation. Moreover, 
the fact that any such circuit has an efficient TTN repre- 
sentation also tells us that such a circuit cannot produce 
much entanglement. The state of the computer at any 
stage of the computation would have an efficient TTN 
representation. 

In order to see this we note that the state of the com- 
puter at a certain stage of the computation is the state 
produced by applying a sub-circuit of G (Gi) to the in- 
put. In order to produce the same state using our graph 
state we introduce new 'output' qubits in the graph state 
corresponding to G. These are additional qubits inserted 
in the chains just after the sub-graph state corresponding 
to Gi, which function as links connecting this sub-graph 
to the rest of the system (they have no vertical cphase 
connections between different chains). The reason we in- 
troduced these new qubits is that the qubits that immedi- 
ately follow Gi might have additional connection between 
the chains. Obviously the vertices corresponding to the 
new output qubits are of degree 2 so that if the original 
graph had an efficient TTN representation so would the 
new graph with the additional qubits. Now, the state 
of the quantum computer after applying Gi would be 
the state (up to local phase corrections) of the new out- 
put qubits after measuring the qubits corresponding to 
Gi according to the circuit, and measuring the rest of 
the qubits in the z basis. As single qubit measurements 
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cannot increase the Schmidt rank the new output qubits 
would also have an efficient TTN representation. 

It should be noted that the opposite claim is not true. 
There are circuits which do not produce much entan- 
glement but do not have an efficient tensor contraction 
scheme. Obvious examples are classical circuits (with 
'classical' computational basis input and output measure- 
ments) such as modular exponentiation, which do not 
produce any entanglement and yet are not likely to have 
an efficient tensor contraction scheme 
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